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Abstract: In contrast to previous analyses, we demonstrate a Bayesian approach to the 
estimation of the CKM phase a that is invariant to parameterization. We also show that in 
addition to computing the marginal posterior in a Bayesian manner, the distribution must 
also be interpreted from a subjective Bayesian viewpoint. Doing so gives a very natural 
interpretation to the distribution. 

We also comment on the effect of removing information about B 00 . 
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1. Introduction 

A number of papers have been published recently that form a lively debate about the 
nature of inference in particle physics in general, and in the extraction of the CKM phase 
a from measured branching ratios and asymmetries in particular (see e.g. and references 
therein for theoretical motivations and recent experimental results). 

The first paper, Charles et al., (2|, proposed several different parameterizations of the 
CKM phase a problem and showed, in their formulation, that different parameterizations 
resulted in different posterior marginal distributions for a. These different distributions 
were held to be the result of using flat priors in the different parameterizations. The 
interpretation of p(a) in Charles et al. also claimed that it did not correctly identify the 
8 known mirror solutions to the CKM phase a problem. Charles et al. also provided a 
simple 2-dimensional problem which they claimed showed similar features. 

Charles et al. is a criticism of the approach taken by the UTfit collaboration ||], and 
Bona et al. replied in In this paper the emphasis is shifted from full distributions over 
a to 95% probability regions, which are shown to be very similar to the 95% confidence 
intervals given in Charles et al.. Bona et al. also note that the identification of the 8 
modes in the 1-CL plot of Charles et al. is not robust to slight changes in the values of 
the observables, and that, in practice there is plenty of information regarding the hadronic 
amplitudes which can (and should) be used to remove some of the degeneracy. 

Charles et al. replied in ||, criticizing the change of emphasis from p(a) to 95% prob- 
ability intervals as being an admission that the approach of Bona et al. has significant 
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dependence on the parametrization chosen. They also repeated their criticism that the 
Bayesian marginal posterior, p(a) does not show the expected 8-fold ambiguity. 

A paper by Botella and Nebot || took another approach, noting that some param- 
eterizations used in the analysis of the CKM phase a problem are inadequate if they go 
beyond the minimal Gronau and London assumptions [0]. In particular, the "modulus and 
argument" (MA) and "real and imaginary" (RI) parameterizations of Charles et al. were 
shown to not uniquely identify a in the parameterization, leading to the leaking of spuri- 
ous information into p(a). Botella and Nebot identified which parameterizations do not 
suffer from this problem. They also, however, concentrated on probability regions, though 
they came tantalizingly close to giving the correct Bayesian interpretation of p(a) in their 
appendices C and E. 

In this paper we will show how to perform a Bayesian analysis of the problem that 
results in the same p{a) for any parameterization. We also show how regarding p{a) as a 
Bayesian subjective distribution, i.e. one that describes our state of knowledge, allows it to 
be correctly interpreted in a straightforward manner - it is not sufficient just to use Bayes 
Theorem to perform computation, the result of that computation must also be interpreted 
from the Bayesian perspective. 

We begin by reconsidering the simple 2-dimensional problem with mirror solutions of 
Charles et al. as it is illustrative of some of the main points we wish to make. 

2. Mirror Solutions in a Simple 2D Problem 

The problem, from section VIII of Q, is presented as "a theory predicts the expressions of 
two observables X and Y as functions of the two parameters a and fi": 

X = (a + /i) 2 

Y = fi\ (2.1) 

where "an experiment has measured the observables from a Gaussian sample of events" 
with the results: 

X = 1.00 ±0.07 

Y = 1.10 ±0.07. (2.2) 

In terms of the assumed physics, only a is of interest. 

It is important even at this early stage of the analysis to be clear regarding what is 
considered an "observable" , what is considered a "parameter" , and what is meant by saying 
that an observable has a distribution, or that a parameter has a distribution. Observables 
are expected to have values that vary with different experimental data sets, and saying 
that an observable has a distribution quantifies the uncertainty due to a particular data 
set. Saying that a parameter has a distribution is a Bayesian concept, indicating that there 
is actually a true, fixed, value, and that the distribution represents our state-of-knowledge 
regarding what that value might be. 
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This distinction is often somewhat artificial, however. Typically the quantities labeled 
as observables are not actually observed directly, instead they are themselves inferred from 
observed data. Different data sets will give different distributions over the observables and, 
consequently in the Bayesian framework, different distributions over the parameters. In 



equation 2.2, for example, the means and variances for X and Y are the summary results 
of a particular data set. 

The standard approach to computing a joint Bayesian posterior distribution for a and 
/x is to use equations ( [2,1| ) and (|2.2| ) to define a likelihood, and then to combine it with a 
prior, p(a,[i), on a, //, giving 

Pa^a^ld) oc — — - exp g_ L _ ^ J j p ( af/i ) (2.3) 

where d denotes the experimental data and X, Y, ax and cry are derived by considering the 
full expression for the likelihood over the individual measurements. They are all functions 
ofd 12 . 

This formulation is subject to the standard criticism that different parameterizations 
require different priors - if, for example, we were to parameterize the problem by a, // where 
// = fj?, then clearly flat priors on /j, and // will result in different posterior distributions 

The discussion of observables and parameters above motivates an alternative Bayesian 
analysis, one that results in a posterior distribution that is invariant to the parameterization 
chosen. In this analysis we first use the observed data to obtain a posterior distribution 
over X and Y. This requires a prior on the observables, and yields 

1 . ( {x-Xf -(y-Yf 



p^ x >yw « ^^j-p [—2% 2^— ) p{x > y) - {2A) 

Placing priors in the space of observables is reasonable: it is here that the experimenter 
will typically have good prior knowledge - prior knowledge that determined the design of 
the experiment. 

The physical parameters of interest, a, are related to X, Y by the deterministic 
relationships in equation (|P|). The distribution p a ^(a, fi\d) is thus computed by the 
change of variables rule. When the posterior for a, [i is computed in this way, the general 
result in Appendix |A| can be used to show that the resulting posterior marginal distribution, 



1 This simple for of the likelihood is a result of the assumed Gaussian errors. In general, it will not be 
expressible in terms of summary statistics. 

2 Conditioning explicitly on the data, ck,i = 1.. .Nd Charles et al.'s "Gaussian sample of events", gives 



t— r i / — ix — di] 

(x\d) oc p(x) exp f 

f = l V27K7 e 



2(7? 



It is well known that the product of two Gaussians has variance less than either of the two. As a consequence 
p(d\x) becomes steadily more peaked as more data is collected (Nd increases). The prior p(x) does not 
change. Thus, contrary to what is claimed in Charles et al., it is often simple to show that "the relative 
prior dependence of the posterior distribution is reduced as the statistical information from the measured 
data is increased". 
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Figure 1: Left : The posterior distribution of fi, a; Right : the marginal distribution p a (a \d). 

p a {a\d) is invariant with respect to the chosen parameterization of the other variables (in 
this case, fi). 

Changing variables gives 



Pa^ipt, lAd) oc px,y(x(a, fi), y(a, n)\d) 



d(X,Y) 



resulting in 

Pa,ti{0L, fj,\d) oc 



2lTO'xO'Y 



exp 



d{a,n) 



24 



24 



(2.5) 



(2.6) 



on the assumption of a flat prior p(x,y), and where the factor of 4 is removed because of 
the multiple solutions. This is plotted in figure |l[ 

Comparing equation ( |2.6| ) with equation (|2.3| ) it is clear that this transformation of 
variables formulation is equivalent to using the prior 



In this problem it is straightforward to show that the Jeffrey's prior ||, given by ^J\I(a, 
where I() is the Fisher Information matrix, is also proportional to |^(a + /u)|. The Jeffrey's 
prior is the prior that is invariant to transformation of the variables. Thus, computing a 
posterior px t Y( x ,y\d) using a uniform prior on X and Y followed by a transformation of 
variables to give p a ^(a, /i\d) is equivalent o using a Jeffrey's prior on a, fi. 

While figure (left) looks very similar in projection to figure 5 in H, note, however, 
that the modes of p a ^{a,n) are not located at the values of a that were found by sub- 
stituting the mean values X and Y into equations ( |2.1| ). They are shifted because of the 
presence of the term |//(a + in the expression for P a ^(a,[i) in equation ( p.6| ), coming 
from the determinant of the Jacobian of the transformation from X, Y to a, /i. In this case 
the displacement of the modes is small; it is not visible in figure |l|. This need not be the 
case in general, and indeed is not the case for the CKM phase a problem. See section ||. 

The simplest way to form the marginal distribution p(a) is to generate samples from 
the distributions of X and Y, to transform these samples into samples of a and fj,, and 
then to plot a histogram of the samples of a [10]. In this case we generate samples Xi <— 
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A/"(1.00, 0.07) and <— AA(1.10, 0.07), i = 1 ... N for some suitably large N, and from each 
pair (xi,yi) we find the four solutions for (ai,fJLi), namely 

Hi = ty^fVi (2.7) 

where e x = ±1, = ±1 and each of the four (ctj,/^) pairs is given weight 1/4 3 . 

In the right panel of figure || we plot the marginal distribution p a (a\d), which is 
very similar to figure 6 (bottom) from Charles et al.. In their discussion of this figure, 
Charles et al. state that "if a and ^ are fundamental physics parameters, Nature can only 
accommodate a single pair of values", and criticize the Bayesian approach by saying that 
the marginal p a (a\d) only has 3 peaks, with the peak at zero being higher than the other 
two. This is an incorrect interpretation of the distribution. This distribution is in fact 
exactly right when interpreted as a Bayesian subjective distribution, as representing our 
state of knowledge. Nature has chosen one of the four modes visible in the joint distribution 
Pa,/j,{ a : A 4 )- We do not know which one. On the basis of our knowledge, there are two 
chances out of four that Nature has chosen a « 0, so our state of knowledge is exactly 
that a ~ is twice as likely as a ~ -2 or a ~ 2. This is precisely what is shown by the 
distribution in the right panel of figure |], where the central mode has twice the area of 
each of the other two modes. 

This simple problem has illustrated two of the key points we wish to make, namely 
that the posterior distribution must be interpreted in a subjective Bayesian manner, and 
that the posterior distribution in this type of problem can be found by putting priors in the 
space of observables, and then using the transformation of variables rule to compute the 
distribution over the parameters derived from the observables. The simple problem is not 
rich enough to clearly demonstrate that this approach also leads to posterior distributions 
for a which are independent of the parameterization chosen. To do this, we turn now to 
the full CKM phase a problem. 



3. Extracting the CKM Phase a 

There are six observable parameters involved in the CKM Phase a problem, three CP 
averaged branching fractions, £> H , B +0 , B 00 , the direct CP asymmetries C" 1 and C 00 , 
and the B°B° mixing-induced CP asymmetry, S~* . These have been recently measured 
by the B-factory experiments BaBar and Belle [jl], |ll[ . 

The general formula for the branching ratio of a 2-body decay of a meson B can be 
found in |l2[ (eqs. 38.16 and 38.17). Specializing to a final state of light mesons, and 



3 Note, however, that with finite probability some of the samples Xi and/or j/i will be negative, resulting 
in imaginary values for fii and/or complex values for oti. This is not a problem with probability theory. 



What it indicates is that the Gaussian distributions in equations (2.2) are only approximations to the true 
distributions of X and Y. 
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Observable 


B+- 


B+° 


#00 


Mean±std 


(5.1 ±0.4) x 10" 6 


(5.5 ±0.6) x 10~ 6 


(1.45 ±0.29) x 10" 6 


Observable 


c+- 




S+~ 


Meanistd 


-0.37 ±0.10 


-0.28 ±0.40 


-0.50 ±0.12 



Table 1: World average values for the observables, from 



averaging over CP-eigenstate yields: 

T$ j \A^\ 2 + \A^\ 2 
16tt M B h 2 
\A^\ 2 - \A l i\ 2 
\A i i\ 2 + \ A i i\ 2 
2Im(A+-A+-*) 
\A+~\ 2 + \A+-\ 2 ' 

The decay amplitudes can be parameterized in a number of ways. Here we will consider 
three parameterizations, the Pivk-LeDiberder (PLD) and Explicit Solution (ES) parame- 
terizations considered in Charles et al. and the so-called li parameterization from Botella 
and Nebot. These vary in how they parameterize A 13 and A 13 , but all include a explicitly 
as one of the parameters. Details of the parameterizations are given in appendix [B|. 

Denote the parameterizations as (a, 4>pld), (&, 4>es) and (a, (f>u), where (fipLD denotes 
the other five parameters of the PLD parameterization, and similarly for <j>ES and <f>u. 
Denote by O the set of six observables, , B 00 , £> +0 , , C 00 and . Then we have 

O = f{a,(j)p LD ) 
= g(a,4>Es) 
= h(a,<f>u) , 

where the functional forms of /(), g() and h() can be derived from the parameterizations 
given m Appendix ||. Table @ gives the values for the observables and their uncertainty 
that are used in this work 4 . Using a uniform prior in the space of observables, these define 
a multivariate Gaussian posterior, p(0\d) where d is the experimental data. 
Using the change-of-variables formulation gives 



B ij = 
C ij = 

S+~ = 



PPLD{a,4>PLD) = p(f{a,4> PLD )\d)\Jf 



and the marginal distribution for a is given by 

Ppld{u) = p(f(a,<j>PLD)\d)\Jf\d<f>PLD- (3.1) 

J<t>PLD 

4 The values for the observables given in Table [l] are those used in Q , as we wish to compare our method 
with theirs. Subsequent improved measurements result in the distributions only having four modes. See 
Appendix B of @. 
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a a 

Figure 2: The first three plots show the marginal posterior distributions for a under the PLD, 
ES and li parameterizations generated by inverting the systems. The short vertical red lines on 
the top left plot indicate the central values obtained by Legends indicate the tuple of signs 
corresponding to each mode. The final plot is of samples generated from the PLD parameterization 
using the tempered transitions MCMC scheme. Binning in a is identical for all figures. In the first 
three plots, a sample of 100000 sets of observables is drawn, and the choices of signs, as indicated 
by the legends, allows each mode to be determined separately. As a result, the sum histograms 
have 800000 non independent entries. The fourth histogram is of size 100000. 



Similarly 

pu(a) = I p(h(a,(f)ii)\d)\J h \d(j)ii. (3.2) 



In appendix |A| we show that under reasonable conditions these marginal distributions 
are identical, i.e. that the marginal posterior distribution for a is independent of the chosen 
parameterization. This should not be surprising - the same information on the same 
observables gives the same information about the same physical parameter. 

In figure |2] we plot histograms representing the three marginal posterior distributions. 
The samples were generated by sampling the observables and inverting the systems 5 . 
As expected, the three histograms are essentially identical. We also show a histogram 
of samples generated using the PLD parameterization and a Markov chain Monte Carlo 
algorithm [13|. As expected, the histogram is the same as the others. It is included 



to demonstrate that our approach is not restricted to cases where the system can be 
inverted. Care must be taken in choosing the MCMC scheme, as the distribution is strongly 



5 If we choose to use non-flat priors on the observables., then we can generate samples representing the 
distribution p(a, <f>) by generating samples from the observables, weighting each sample by the prior, and 
then re-sampling the set of weighted samples to give samples from the posterior. See jl^] for details. 
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multimodal. We used the simulated tempering scheme of [14] which successfully sampled 
the 8 modes of the distribution. 

If we consider the modulus-and-argument (MA) parameterization & equation 4], we 
will not, however recover the same distribution for a. The determinant of the Jacobian 
for the MA system is identically zero for a = 0, and this results in a spurious zero in 
probability at a = 0, the remainder of the distribution being identical to our figure [2|. This 
adds to the discussion in 1J that the MA parameterization, by going beyond the minimal 
Gronau and London assumptions, is unsuited to the analysis of this problem. 

The histograms generated by inverting the systems are clearly composed of 8 modes, 
one for each of the 8 solutions. (There are two modes that overlap almost totally around 
a ~ 140°.) By construction, each of these modes has equal probability mass (=1/8), 
even though they are different shapes; the heights and widths vary, but the area beneath 
each mode is the same. Each possible solution for a has different uncertainty (due to the 
complex relationship between a and the observables), but each mode has equal probability 
to be the one chosen by Nature 67 . 

The final marginal distribution is the sum of these 8 modes, which is plotted as the 
dotted line. This shows a large peak around a = 140° and a number of smaller peaks. 
Again, this distribution correctly describes our state of knowledge - there are 2 of the 8 
modes near a = 140° and, because we don't know which mode Nature has chosen, there 
are thus 2 chances out of 8 that a ~ 140°. There is only 1 chance out of 8 that a ~ 80°, so 
the peak there has half the area of the peak at a = 140°. This accurately represents our 
state of knowledge about a. 

Also shown on figure ^ are short vertical lines marking the values of a that are found 
when the mean values for the observables are transformed into the different parameter- 
izations. Again, it comes as no great surprise that the mean of the distribution of the 
inputs is not transformed to the mean of the distribution of the output, especially when 
the uncertainty on some of the variables is of the same order as the value itself, and the 
system of equations is highly nonlinear 8 . This also naturally explains why there is still 
finite probability density that a = 0/180°. 

As the methodology presented in this work relies on the one-to-one relationship (up to 
discrete ambiguities) between the observables {£> H , £> +0 , £> 00 , C H , C 00 , 5" 1 } and the 



6 The reader is reminded that we are reconsidering the case discussed in Charles et al.. A complete 
analysis of the CKM phase a problem would include additional information which would break the symmetry 

In this case, and in the 2d problem in section Kl it is known by construction that each mode contains 
the same proportion of the total probability (1/4 for each mode in the 2d problem and 1/8 for the CKM 
phase a problem). In general, however, this may not be known in advance. Using a numerical search 
routine with random restarts can be used to locate the modes, and the Hessian, H, at each mode can be 
computed. (Often this will be computed as a by-product of the numerical optimization.) The probability 
volume in each mode can be approximated by p(9) / \y 'det(H /2n) where 8 are the parameters at the mode 
fll6| . Alternatively, samples generated without knowing how many modes are present (e.g. by using the 
tempered transitions MCMC scheme) can be clustered, and the number of samples in each cluster gives a 
measure of the probability volume in that mode. 

8 We note, however, that as the variances of the observables are reduced, the mean values remaining 
fixed, that the modes do converge to the values given by inverting the mean values. 
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Figure 3: Left: Posterior distribution for the 'li' parameterization when C and B are uniformly 
sampled in [-1, +1] and [0, 20B 00 ], respectively. Sampling is identical to figure ^. Right: Joint 
distribution of B 00 and C 00 implied by the observations and the assumption of isospin symmetry. 

underlying isospin amplitude representation, the analysis of the case when /3 00 and C 00 are 
not measured is not in general possible, once the system has been inverted. For instance, 
although the PLD representation presents the very appealing feature that a appears in the 
system ( |B.2| ) only in the expressions for £> 00 and C 00 , and therefore cannot be determined 
when the latter are not measured, this feature is not obvious anymore in the inverted system 
( B.3[ ). This is equivalent to the fact, already mentioned in Botella and Nebot (section C.l), 
that {B 00 , C 00 } are algebraically constrained by any set of measurements {B^ , B +0 , C H , 
} and the assumption of isospin symmetry. As noted by Botella and Nebot, sampling 
C 00 uniformly between —1 and +1, and B 00 between and B max , results in a distribution 
that is much natter than those shown in figure This distribution does not, however, 
become flat as B max — > oo, because ultimately the shape of the underlying single mode 
distributions will be driven by the algebraic constraints from the isospin assumption and by 
the error propagation from the measured observables. As an illustration, we show in figure 
|3| the result of the 'li' parameterization for B max = 20,6°°. Increasing the upper bound on 
B 00 will not change the final distribution, but will result in more samples being thrown away 
as incompatible with the constraints on the system. Figure [3| (right) shows a histogram of 
the samples of B 00 and C 00 that were retained. It shows the probabilistic constraints on 
B 00 and C 00 due to the observations and the assumption of isospin symmetry. 

4. Conclusions 

In the debate concerning the analysis of the CKM phase a problem we have contributed two 
important points. The first is a formulation of the problem that is invariant to the choice 
of parameterization. The second is the correct interpretation of the posterior marginal 
distribution for a as a representation of our state of knowledge. 

In the CKM Phase a problem the relationships between the parameters of the model 
and the observables is deterministic. In this case the appropriate statistical technique to 
find the distribution over the model parameters is that of the transformation-of- variables. 
This gives us a distribution over the model parameters that summarizes our state of knowl- 
edge. It does not, and cannot, tell us if our model is true or false. We have no way of 
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knowing the actual mechanisms of the external universe. We can only generate models of 
the universe and use data to cast light on these models. However "true" we may think 
our models are today, better models will certainly be developed tomorrow. The scientific 
method is composed of the cycle of model formulation, testing against observations, and 
model revision and development. Bayesian statistics provides many tools to facilitate this 
process. 
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A. Reparameterization invariance of the marginal posterior pdf over a 

We consider a system of N random variables Aj {i = 1...N), which are related to a set of 
N observables Oi as Oi = /i(X). We also assume that it is possible to reparameterize the 
variables Aj into a set Y{ so that X\ = Y\ = a, Yi = <fo(X), and Oi = <?«(Y). Within the 
Bayesian framework, we consider a dataset d used to estimate the observables, which yields 
the posterior pdf po(o\d). Under the further hypothesis that /, g and 4> are invertible, we 
can write the marginal posterior on a using the parameterization Y as: 

Pa(a\d) = J ... J po(o\d)\J g \dy 2 ...dy N asp x (x|cf) =p (o\d)\J g \ (A.l) 

= J ... J po(o\d)\J g \\J < j > \dx 2 ...dx N (A.2) 

= J ... J po(o\d)\Jf\dx 2 ...dxN (A. 3) 

= i£(a\d) as p Y (y\d) = po(o\d)\J f \, (A.4) 

proving that the marginal posterior on a is parameterization invariant. Thus, if a Bayesian 
analysis has been performed on the dataset d so that the posterior pdf on the observables 
is known, the marginal posterior on a obtained by the change of variables Yi = 4>i(X.) is 
invariant under reparameterization of the N — 1 marginalized variables Aj, i = 2 . . . N. 

B. Parameterizing the CKM Phase a Problem 

We give details here of the three parameterizations, the Pivk-LeDiberder (PLD), the Ex- 
plicit Solution (ES) and the li parameterizations. 

B.l The Pivk-LeDiberder Parameterization 

PLD introduces six parameters, a, a e ff,fj,, a, a, A, via 

A + - = fia , A + - = fiae 2ia eff 
A+° = /[ /e i ( A -°) , A +0 = ^ A+a ) (B.l) 

A 00 = ue^ A - a ^ ( 1 - ^e-^-A , A 00 = ue i(A+a ^ ( 1 - A e -*(A+ Q -2a e// A 



ue y Ml -=e K ' , Ji = ae 

J V V2 
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which results in 



B +- = c^V (a 2 + « 2 ) 

(2—2 \ 
2 + ° 2 " v^acos (A - a) + a cos (A + a - 2a e// ))J 

£+° = Cr B V 
a 2 - a 2 

C+- = fL-JL (B.2) 
\/2(acos (A — a) — a cos (A + a — 2a e ff)) 



a 2 + a 2 



oo 



2 + - ^2(acos (A - a) + a cos (A + a - 2a e //)) 



j aa 

d =2^ ttt sm 2a e f /• 

a z + a z 

where C = (167rM^/i) _1 . This system can be solved to give 

2 B+» 

a 2 = K(l + C + -) 
a 2 = K(l-C+-) 

sm2a & ff = — = sins (B.3) 

n V /1-(C+-) 2 V ' 

(l + C+-)^-2^(l + C 00 )+2 

cos (A — a) = : = cos t 

2 y / 2K(l + C+-) 

(1 - C+-)K - 2K&£=(1 - C 00 ) + 2 

cos (A + a — 2a e ff) = , = cos u 

y n ' 2y/2K{l-C+-) 

13^ — T 

where we define K = , and s,t and u as in the final three equations. The fourth 

equation yields 2a e // = s or 2a e ff = ir — s. The final two equations yield A + a = et + s 
orA + a = et + 7r — s and A — a = e'u or A + a = et + tt — s, respectively, where e, e' = ±1. 
Finally, we obtain a = et + e'u + s or a = et + e'u + ir — s as the 8 solutions corresponding 
to each set of values of the observables. 



B.2 The Explicit Solution Parameterization 

The Explicit Solution (ES) parameterization [jlT]] begins with the same parameters as the 
PLD parameterization, and then defines 

c = cos(</>), (j) = a — A 

c = cos(0), ifi = a + A — 2a e ff 

and also s = sin(a), s = sin(^). Using the identity 2a = 2a e ff + <f> + <t> allows the following 
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solution to be derived. 

tan a 



sm(2a e ff)c + cos(2a e jj)s + s 
cos(2a e ff)c — sin(2a e f f)s + c 

s + - 



Vi - c+- 



2 



sin(2a e// ) 

cos(2a e// ) = ±^/l-sin 2 (2a e// ) 

, r — ^6 +0 + + C+-)/2 - B 00 (l + C 00 ) 



c = 



c = 



T B+ T B+ 



^B +0 + B + -(1 - C+-)/2 - £ 00 (1 - C7 



00 \ 



r B o y/2B+-B+®{l - C+- 



s = ±\A - c 2 



s 



±\A - c 2 (B.4) 
where the 8 solutions in the range [0, tt] are apparent from the three arbitrary signs. 

B.3 The li Parameterization 

Botella and Nebot introduce the following parameterization 

A+- = e~ ia T 3/2 (T + iP) 
^2A+° = e~ ia T 3/2 

A +~ = e +i<*T z/2 {T-iP) 
^A 00 = e~ ia T 3/2 (l -T-iP) 
^2A +0 = e +ia T 3/2 
y/2 A 00 = e +ia T 3/2 (l -T-iP) 

and writing T and P in terms of real and imaginary parts allows the system of equations 
for the observables to be inverted in terms of a, T 3 / 2 , T r , Ti, P r , Pi, in the following way : 



T = 



2B+° 



Pi 



\r B+ C 

2B +0 t b o + (B+--2B 00 ) r B+ 

AB+°t b o 
(2 goo C°° - B+- C+~) t b+ 
4B+°t b o 

(Ti + P r ) 2 = 2 £+0 Tbo(1 +V-) " ^ " ^ 
{Ti - P r f = 2B+ Tb0 ^ C +-) ~ ^ + Pi)2 

( Vb 2 + a 2 -c 2 + a\ 
a = arctan ± 

V c ~ b J 

with a = (T 2 - Pf + T 2 - P 2 ), b = 2PiTi + 2P r T r , c = S+-B+-t b+ /(2t b oB +0 ) 
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